Scaling tree-based automated machine learning to biomedical big data with a feature set selector
نویسندگان
چکیده
منابع مشابه
Scaling Datalog for Machine Learning on Big Data
In this paper, we present the case for a declarative foundation for data-intensive machine learning systems. Instead of creating a new system for each specific flavor of machine learning task, or hardcoding new optimizations, we argue for the use of recursive queries to program a variety of machine learning systems. By taking this approach, database query optimization techniques can be utilized...
متن کاملAutomated Machine Learning on Big Data using Stochastic Algorithm Tuning
We introduce a means of automating machine learning (ML) for big data tasks, by performing scalable stochastic Bayesian optimisation of ML algorithm parameters and hyper-parameters. More often than not, the critical tuning of ML algorithm parameters has relied on domain expertise from experts, along with laborious handtuning, brute search or lengthy sampling runs. Against this background, Bayes...
متن کاملLearning ELM-Tree from big data based on uncertainty reduction
A challenge in big data classification is the design of highly parallelized learning algorithms. One solution to this problem is applying parallel computation to different components of a learning model. In this paper, we first propose an extreme learning machine tree (ELM-Tree) model based on the heuristics of uncertainty reduction. In the ELM-Tree model, information entropy and ambiguity are ...
متن کاملVisual Learning by Set Covering Machine with Efficient Feature Selection
In this paper, we propose a new visual learning method for real-world object recognition task. Our method is based on the Set Covering Machine (SCM), to make the learning time shorter than the methods based on commonly used trial-and-error algorithms, such as genetic programming and reinforcement learning. Generally, the process of visual learning is quite time-consuming because image data cons...
متن کاملHow big data changes statistical machine learning
This presentation illustrates how big data forces change on algorithmic techniques and the goals of machine learning, bringing along challenges and opportunities. 1. The theoretical foundations of statistical machine learning traditionally assume that training data is scarce. If one assumes instead that data is abundant and that the bottleneck is the computation time, stochastic algorithms with...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Bioinformatics
سال: 2019
ISSN: 1367-4803,1460-2059
DOI: 10.1093/bioinformatics/btz470